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We present a measurement of the top quark pair production cross section in pp collisions at 
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y/s — 1.96 TeV utilizing 425 pb~^ of data collected with the DO detector at the Fermilab Tevatron 
Collider. We consider the final state of the top quark pair containing one high-pr electron or muon 
and at least four jets. We exploit specific kinematic features of tt events to extract the cross section. 
For a top quark mass of 175 GeV, we measure cr^-j = 6.4^ j j (stat) ± 0.7 (syst) ± 0.4 (lum) pb, in 
good agreement with the standard model prediction. 

PACS numbers: 13.85.Lg, 13.85.Qk, 14.65.Ha 



I. INTRODUCTION 

Since the discovery of the top quark in 1995 by the 
CDF and DO experiments [l| , the Fermilab Tevatron pp 
Cohidcr with its center-of-mass energy of y/s = 1.96 TeV 
is still the only collider where top quarks can be stud- 
ied. Within the standard model, top quarks arc produced 
either in pairs via strong interactions or as single top 
events via electroweak interactions with a lower expected 
cross section Q ■ Evidence for the latter production mode 
has been recently found by the DO collaboration 0]. At 
the current Tevatron Collider center-of-mass energy, top 
quark pair production is predicted to occur via qq an- 
nihilation or gluon fusion with a ratio of approximately 
85:15. 

The tt pair production cross section was measured in 
various channels during Run I of the Fermilab Tevatron 
Collider at a center-of-mass energy of y's = 1.8 TeV [1]. 
The precision of these measurements was severely limited 
by available statistics. The 10% higher collision energy of 
the current Tevatron Collider run leads to a 30% higher 
expected top quark pair production rate; together with 
an increased luminosity, the precision on measurements 
of the top quark production and decay properties can 
therefore be substantially increased. The latest theoreti- 
cal calculations d, Hi, J} of the tt production cross section 
at next-to-next-to- leading order (NNLO) have an uncer- 
tainty ranging from 9% to 12%. Recent measurements 
with a data set approximately twice as large as in Run 
I [1, Q are consistent with these predictions within the 
uncertainties. 

Deviations from the standard model could occur due 
to the presence of new physics, such as resonant tt pro- 
duction [lo| . a novel top quark decay mechanism, as for 
example, t H'^b [ll| or a similar final state signature 
from a top-like particle Some of these effects could 
cause the inclusive tt cross section {<Jtt) to be different 
from the standard model prediction. Others could cause 
differences in top decay branching fractions, thus lead- 
ing to a{tt) measured in different decay channels to dis- 
agree with the expectations computed using the standard 
model branching fractions. Therefore measurements of 
Utt in different top quark decay channels and using dif- 
ferent analysis methods complement each other. 

In this paper we present a new measurement of the 
top quark production cross section in the ^-|-jets channel, 
where one of the W bosons decays hadronically, and the 
other one leptonically into an electron {W ev) or a 
muon {W fiiy). W boson decays into a r lepton with 
a subsequent decay of the latter into an electron or a 



muon are included in the signal sample. Each of the two 
decay channels represent approximatively 17% of the to- 
tal top quark pair production and decay. We exploit the 
kinematic properties of the events to separate ti signal 
from M^-l-jets background, instead of the often-exploited 
requirement of a final-state separated vertex that is con- 
sistent with the b decay. This choice makes this measure- 
ment less dependent on the assumption that a top quark 
decays into a 6-quark. 

The measurement is based on a data sample taken be- 
tween August 2002 and August 2004 with an integrated 
luminosity of 425 pb~^, which represents approxima- 
tively a factor two increase with respect to the previously 
published measurement by the DO experiment ^Sj]. 

After a short description of the relevant DO detector 
parts and underlying object identification algorithms, we 
describe the data and Monte Carlo samples, the event se- 
lection, the background determination and the procedure 
to extract the top quark signal. Finally, we discuss the 
systematic uncertainties associated with the cross section 
measurement. 



II. DO DETECTOR 

The DO detector [isj is a nearly hermetic multi-purpose 
apparatus built to investigate pp interactions at high 
transverse momentum. The measurements reported here 
rely on the tracking system, the Uranium-Liquid Argon 
calorimeter, the muon spectrometer and the luminosity 
detectors, which are briefly described below. The co- 
ordinate system is right handed with the z axis along 
the Tevatron proton beam direction, the y axis vertical 
and the x axis pointing outside of the accelerator ring. 
The coordinates are also expressed in terms of the az- 
imuthal angle ip, rapidity y and pseudorapidity rj. The 
latter are defined as functions of the polar angle 9 as 
yi9,P) = iln[(l-f /3 cos ^)/(l-/3 cos 0)]; 77(0) = y(0,l), 
where (3 is the ratio of particle momentum to its energy. 
When the center of the DO detector is considered as the 
origin of the coordinate system, these coordinates are 
referred to as detector coordinates (pdct and ?7det; when 
the reconstructed interaction vertex is considered as the 
origin of the coordinate sytem, these coordinates are re- 
ferred to as physics coordinates ip and rj. 

The tracking system includes the Silicon Microstrip 
Tracker (SMT) and the Central Fiber Tracker (CFT). 
A superconducting solenoid surrounds the tracking sys- 
tem and provides a uniform magnetic field of 2 T. The 
SMT is a system closest to the beam pipe. It has six 
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barrels in the central region of |?7dot| < 1-5, each barrel 
is 12 cm long and capped at high \z\ by a disk with an 
external radius of 10.5 cm. Each barrel has four silicon 
readout layers, composed of two staggered and overlap- 
ping sub-layers. Each small-radius disk is composed of 
twelve double-sided wedge-shaped detectors. Track re- 
construction in the forward region up to |77dct| < 3 is 
provided by two units composed of three small and two 
large radius disks located at \z\ = 44.8, 49.8, 54.8 cm 
and 110, 120 cm respectively. Large radius disks are 
composed of 48 single-sided wedges with an external ra- 
dius of 26 cm. The CFT consists of 8 concentric cylinders 
and covers the radial space from 20 to 52 cm. The two 
innermost cylinders are 1.66 m long, and the outer six 
cylinders are 2.52 m long. Each cylinder supports two 
doublets of overlapping scintillating fibers with the di- 
ameter of 0.84 mm, one doublet being parallel to the 
beam axis, the other with an alternating stereo angle of 
±3°. Light signals are transfered via clear optical fibers 
to solid-state visible hght photon counters (VLPCs) that 
have a quantum efficiency of about 80%. Tracks are re- 
constructed combining the hits from both tracking detec- 
tors. 

The calorimeter is used to reconstruct jets, electrons, 
photons and missing transverse energy of non-interacting 
particles such as neutrinos. The DO Uranium-Liquid Ar- 
gon calorimeter which surrounds the tracking system is 
divided into the Central Calorimeter (CC) up to |?7dot| — 
1.0 and two Endcap Calorimeters (EC) extending the 
coverage to \ridet\ — 4.0, housed in separate cryostats. 
Each calorimeter consists of an electromagnetic section 
with depleted Uranium absorber plates, a fine hadronic 
section with an Uranium-Niobium absorbers and a coarse 
hadronic section with Copper (Stainless Steel) absorbers 
in the CC (EC). The calorimeter is compact and highly 
segmented in the transverse and the longitudinal direc- 
tions with about 56,000 channels in total. In (/s, the 
electromagnetic part is divided into 64 modules and the 
hadronic part into 32 modules. The electromagnetic part 
has a depth of about 20 radiation lengths (Xq); and with 
the hadronic sections, the calorimeter has a total of 7.2 
nuclear interaction length (A/) at 77 = and of 10. 3A/ 
at I77I ~ 4. The intcr-cryostat region is equipped with 
scintillation detectors (intcr-cryostat detectors or ICD) 
to improve energy resolution. 

The muon system is the outermost part of the DO de- 
tector. It consists of three layers of tracking detectors 
used for precise coordinate measurements and triggering 
and two layers of scintillation counters used for trigger- 
ing [l3|. Proportional drift tubes (PDT) cover the central 
region (|?7dot| < 1-0), and mini drift tubes (MDT) extend 
the coverage to |77dct| = 2.0. One layer of scintillation 
counters in the central region and two layers in the for- 
ward region (1.0 < |77dct| < 2.0) along with two layers 
of drift tubes (B and C layers) are located outside of a 
1.8 T iron toroid while the innermost layers (A) of muon 
tracking detectors and scintillators arc located in front 
of it. The support structure underneath the DO detector 



allows only for partial coverage in this region. 

The luminosity is determined from the rate of inelas- 
tic collisions measured by the luminosity monitors (LM) 
located in front of the ECs at 2 = ±140 cm. The LM 
consists of two arrays of 24 plastic scintillator counters 
with photomultiplier readout and covers the pseudora- 
pidity range |?7dot| between 2.7 and 4.4. The uncertainty 
on the luminosity measurement is ± 6.1% [Tst and is 
dominated by the uncertainty on the pp inelastic cross 
section. 



III. OBJECT IDENTIFICATION 

A. Primary vertex 

The primary (or hard scatter) vertex of the event is 
reconstructed in three steps. At the first step, we lo- 
cate a beam spot position using reconstructed tracks 
with the transverse momentum of pT > 0.5 GeV. These 
tracks should have at least two hits in the SMT detector 
and the significance of the distance of closest approach 
Sdca. = \dca/(Jdca.\ < 100. The distance of closest ap- 
proach (dca) is calculated with respect to the center of 
the detector in the plane transverse to the beamline. At 
the second step, we impose a more stringent requirement 
on the tracks, S'dca < 3, where S'dca is calculated with re- 
spect to the beam spot determined in the previous step. 
These tracks are then used to fit the final primary ver- 
tices. We use information on the position of these tracks 
along the beamline to identify tracks belonging to differ- 
ent interactions and build clusters of the tracks within 2 
cm from each other. All tracks in each cluster are fitted to 
a common vertex using the Kalman filter technique . 
Finally, to distinguish the position of the hard scatter 
interaction from the simultaneously produced minimum 
bias scatters, a minimum bias probability is computed for 
each reconstructed vertex based on the transverse mo- 
menta and the total number of associated tracks. The 
primary vertex with the lowest minimum bias probabil- 
ity is selected as the hard scatter. 

The primary vertex finding algorithm reconstructs ver- 
tices in the fiducial region of the SMT with an efhciency 
close to 100%. The position resolution, measured in data 
as a difference between the reconstructed vertex position 
and the position of the beam spot center, depends on 
the number of tracks fitted to the primary vertex and is 
around 40 ^m in the plane transverse to the beam di- 
rection. It is dominated by the beam spot size of about 
30 /im. 

For the analysis, we select events with the primary 
vertex within the SMT fiducial region Izpvl < 60 cm and 
at least three tracks attached to the vertex. 
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B. Electrons 

The electron identification is based on clusters of 
calorimeter cells found in the CC within \r]det\ < 1-1 us- 
ing a simple cone algorithm with a cone size oi TZ ^ 

{Ar])'^ + (Aiy9)2 = 0.2. A cluster is considered as a 
"loose" electron if (i) at least 90% of its reconstructed 
energy is in the electromagnetic part of the calorimeter 
(/em > 0.9), (a) the cluster is isolated, (m) its shower 
shape is consistent with an electromagnetic shower and 
{iv) there is at least one track in a Arj x A(p road of 
size 0.05 X 0.05 around the cluster. The angular coordi- 
nates ry, (/3 of the electron are taken from the parameters 
of the matched track; its energy is determined from the 
calorimeter cluster. The isolation criterion /iso requires 
the ratio of the difference of the total energy within the 
cone size TZ < 0.4 around the center of the cluster and 
the energy deposited in electromagnetic layers within the 
cone size TZ < 0.2 to the reconstructed electron energy 
not to exceed 15%. 

The electron shower shape estimator is built from seven 
observables characterizing the electron shower shapes, 
which are the energy deposits in the first five layers of 
the calorimeter, the azimuthal extension of the cluster in 
the finely segmented third layer of the calorimeter, and 
the logarithm of the cluster total energy. From these 
observables a covariance matrix is built, where the ma- 
trix elements are computed from reference Monte Carlo 
samples at different cluster energies and pseudorapidities. 
The covariance parameter Xh measures the consistency 
of a given shower to be an electromagnetic one. As the 
observables are not normally distributed, Xh does not 
follow a normal distribution and a cut on x% < 50 is 
applied for electrons. 

To define a "tight" electron we combine in a likelihood 
discriminant the variables defined above (/em, Xh) with 
(i) the ratio of the transverse component of the clus- 
ter energy measured in the calorimeter to the transverse 
momentum of the matched track, E^^/p^'^'^^, (ii) the x^ 
probability of a track matched to the calorimeter clus- 
ter, (Hi) the dca of the matched track with respect to 
the primary vertex, (iv) the number of tracks within a 
cone oi TZ = 0.05 around the matched track and (v) the 
sum of transverse momenta of the tracks inside a cone 
of 7?. < 0.4 around, but excluding the candidate track. 
By construction, a discriminant value close to unity cor- 
responds to a prompt isolated electron. Wc require that 
tight electrons satisfy the loose criteria and have a likeli- 
hood discriminant £cm > 0.85. 

The electron energy scale is fixed by comparing the 
di-electron invariant mass distribution in Z — > ee events 
selected from the data with the simulated expectation 
based on a Z boson mass of 91.19 GeV [l3]- Additional 
random smearing of the electron inverse energy is applied 
to tune the simulated electron energy resolution to that 
observed in the data. 



C. Muons 

Muons are identified from tracks reconstructed in the 
layers of the muon system and matched to a track recon- 
structed in the central tracking system taking advantage 
of its superior momentum and position resolution. For 
this analysis, we accept muons having (?) at least two 
wire hits and at least one scintillator hit in both the 
A-layer inside the toroid and the B- and C-layers out- 
side, (ii) three matched reconstructed muon track seg- 
ments from all three muon system tracking layers, [Hi) 
a good quality matched track in the central tracking sys- 
tem (x^/-^dof < 4) and (iv) consistency with originating 
from the primary interaction vertex. The last condition 
includes the requirements that the timing of the muon, 
determined from associated scintillator hits, has to be 
within 10 ns of the beam interaction time, that the small- 
est distance along z axis between the primary vertex and 
the muon track must be less than 1 cm and S'dca < 3. 

Muons arc distinguished as "loose" and "tight" de- 
pending on their isolation with respect to other recon- 
structed objects in the event. The loose muon isola- 
tion criterion is defined by demanding that a muon is 
separated from a jet by A7?,(/i,jet) > 0.5 where ATZ 
is the distance in pseudorapidity-azimuthal angle space. 
For a tight muon identification, the muon is addition- 
ally required to be isolated from energy depositions in 
the calorimeter and additional tracks in the tracking sys- 
tem. The calorimeter isolation requires the sum of the 
calorimeter cells' transverse energies between two cones 
of radius TZ = 0.1 and TZ = 0.4 around the muon track to 
be smaller than 8% of the muon pT- The track isolation 
is based on the sum of the tracks' momenta contained in 
a cone of TZ = 0.5 around the muon track, excluding the 
muon track itself. Wc require the sum to be less than 6% 
of the muon pT- 

The muon momentum is measured from the matched 
reconstructed central track. Due to the limited accep- 
tance of the SMT some tracks have hits in the CFT part 
of the central tracking system only, and therefore their 
resolution is degraded. To improve the momentum res- 
olution of such tracks we apply a correction to the in- 
verse track transverse momentum. It is based on a fit 
constraining the track dca to zero with respect to the 
primary vertex in the transverse plane. 

The muon momentum scale is fixed by comparing the 
di-muon invariant mass distribution in Z — > /i^ events 
selected from the data with the simulated expectation 
based on the Z boson mass. Additional random smearing 
of the muon inverse transverse momenta is performed to 
tune the simulated muon momentum resolution to that 
observed in the data. 



D. Jets 

Jets are reconstructed from calorimeter cells using the 
iterative, seed-based cone algorithm including midpoints 
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[Hi with a cone radius of 7^jet = A/(Ay)2 + {Aip)^ = 0.5. 
The minimum pT of a reconstructed jet is required to be 
8 GeV before any energy corrections are applied. To 
remove jets resulting from noise in the calorimeter or 
created by electromagnetic particles, further quality cri- 
teria are applied: (j) the jet has to have between 5% and 
95% of its reconstructed energy in the electromagnetic 
calorimeter and less than 40% of its energy in the out- 
ermost hadronic section of the calorimeter, (ii) the ratio 
of the highest to the next-to-highest transverse momen- 
tum cell in a jet has to be less than 10, {Hi) a single 
calorimeter tower must not contain more than 90% of 
the jet energy and (iv) the jet has to be confirmed by 
the independent calorimeter trigger readout. 

Previously reconstructed electrons and photons might 
also be reconstructed as jets in the calorimeter. To avoid 
the resulting double-counting, we reject any jet over- 
lapping with an electromagnetic object within a cone 
of 7?. < 0.5 fulfilling the electron identification criteria 
of Sect. IIIIBI and having pT > 15 GeV and 
|?7dot| < 2.5. 

We correct the pr of each reconstructed jet to the par- 
ticle level by applying jet energy scale (JES) corrections 
[l9| . These corrections account for imperfect calorime- 
ter response, the jet energy offset due to the underly- 
ing event, multiple interactions, pile-up effects and noise, 
and the jet energy loss due to showering outside of the 
fixed-size jet cone. We make use of transverse momentum 
conservation in a sample of photon -I- jet events to cal- 
ibrate the jet energy and determine the jet energy scale 
corrections separately for data and simulation. Since the 
jet identification efficiency and energy resolution differ 
between data and simulation, the jet inverse energies are 
smeared and depending on the jet jrydetl from 1% to 3% 
of the jets are removed to reproduce the data. 



E. Missing Et 

The presence of a neutrino in the final state can be 
inferred from the energy imbalance of an event in the 
transverse plane. It is reconstructed from the vector 
sum of the transverse energies of all cells surviving var- 
ious noise suppression algorithms and not belonging to 
a coarse hadronic section of the calorimeter. The latter 
cells are generally excluded due to their higher noise level. 
They are however included if clustered within jets. The 
vector opposite to this total visible momentum vector is 
referred to as raw missing transverse energy vector. 

The calorimeter response to electromagnetic particles 
such as photons, electrons or tt^s is different from that 
due to hadrons and in particular from that due to jets. 
In events with both electromagnetic objects and jets, this 
imbalance propagates directly into missing transverse en- 
ergy (^t)- As a JES correction is derived for all jets 
satisfying criteria {i)-{iv) of Sect. IIIIDl it also has to 
be applied to ^t- In order to do so, the JES correction 
(limited to the response part) applied to jets is subtracted 



from the vector. In an equivalent way the EM cor- 
rection for electromagnetic objects is applied to the 
vector. 

Muons are minimum ionizing particles throughout the 
entire detector. Hence they will deposit only a small 
amount of energy in the calorimeter and their presence 
can thus fake missing transverse energy. Therefore we re- 
place the transverse energy deposited by muons, satisfy- 
ing requirements {i)-{in) of Sect. HITCl in the calorime- 
ter by the transverse momentum measured by the track- 
ing system. 



IV. DATA SAMPLES AND MONTE CARLO 
SIMULATION 

A. Event trigger 

The DO trigger is based on a three-level pipeline sys- 
tem. The first level consists of hardware and firmware 
components that make a trigger decision based on fast 
signal inputs from the luminosity monitor, the tracking 
system, the calorimeter and the muon system. The sec- 
ond level combines the same information to construct 
simple physics objects, whereas the third level is software 
based and uses the full event information obtained with 
a simplified reconstruction. The accepted event rates are 
2 kHz, 1 kHz and 50 Hz respectively for level 1 (LI), 
level 2 (L2) and level 3 (L3). For all events used in this 
analysis the trigger system is required to find at least one 
jet and an electron or muon. 

The DO calorimeter trigger is based on energy de- 
posited in towers of calorimeter cells with a transverse 
granularity of A77 x A(y5 = 0.2 x 0.2. In addition, towers 
are segmented longitudinally into electromagnetic (EM) 
and hadronic (HAD) sections. The level 1 electron trigger 
requires a minimum transverse energy {Et) deposition in 
the electromagnetic section of a tower. At level 2, a seed- 
based cluster algorithm sums the energy in neighboring 
towers and bases the trigger decision on the Et and the 
electromagnetic fraction (/em) of a cluster. At level 3, 
the electron identification is based on a simple cone al- 
gorithm with TZ < 0.25 and the trigger decision is based 
on the requirements on Et, /em and a shower-shape es- 
timator. 

The level 1 jet trigger is based on the Et deposited in 
a full calorimeter trigger tower. At level 2, these towers 
are summed by a seed based cluster algorithm within a 
5x5 tower array. The level 3 jet algorithm uses a simple 
cone algorithm with TZ < 0.5 or 7?. < 0.7 and a decision 
is taken based on the Et within the cone. 

The level 1 muon trigger is based on input from the 
muon scintillator counters, the muon wire chambers and 
the track trigger system. At level 2, muons are recon- 
structed from the muon scintillator and wire chamber 
information and requirements on the number of muons, 
their transverse momentum pT and position in rj as well 
as on their quality can be made. The quality is based 
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on the number of scintillators and wires hit. At level 3, 
muon tracks are fitted using information from the track- 
ing and muon systems. This refines the selection in pT, 
77, and reconstruction quality. 

The data used for the measurement presented in this 
paper were collected between August 2002 and August 
2004 and correspond to an integrated luminosity of 422 
± 26 pb^^ in the /x+jcts and 425 ± 26 pb^^ in the e+jets 
channel, respectively [1^1 . The trigger criteria evolved 
over this period of time to account for the increase in in- 
stantaneous luminosity while keeping a constant trigger 
rate. The different trigger criteria and the corresponding 
integrated luminosity collected are summarized in Ta- 
ble [1] for the e-|-jets and the /n-l-jets data. 



B. Trigger efficiency 

Only a fraction of all produced ti events will pass the 
selection criteria imposed by the trigger system. The 
trigger efficiency for tt events is estimated by folding into 
simulated events the per-lepton and per-jet probability 
to satisfy the individual trigger conditions at LI, L2 and 
L3. The total probability for an event to satisfy a set of 
trigger requirements is obtained assuming that the prob- 
ability for a single object, described below, to satisfy a 
specific trigger condition is independent of the presence 
of other objects in the event. Under this assumption, the 
contributions from the lepton and the jets to the total 
event probability factorize, so that 

-Pcvcnt — -^lepton ^ ^ct- (1) 

Furthermore, under the assumption of independent trig- 
ger objects, the probability Pjct for at least one out of 
iVjet jets in the event to fulfill the jet part of the trigger 
requirement is given by 

^jct = 1-11(1-^')' (2) 

where Pi is the probability for one jet to pass the trigger 
conditions. 

The total trigger efficiency is then calculated as the 
luminosity-weighted average of the event probability as- 
sociated to the trigger requirements corresponding to 
each data taking period. 



C. Trigger efficiency measurement 

The probability for a lepton or a jet to satisfy a partic- 
ular trigger requirement is measured in samples of events 
that are unbiased with respect to the trigger requirement 
under study. Reconstructed leptons or jets are identified 
in the event offline and the trigger cfflciency is deter- 
mined by measuring the fraction of objects satisfying the 



trigger condition under study. These efficiencies are gen- 
erally parameterized as a function of the object px and 

f?dct- 

We use a sample of Z ^ e+e" {Z ^ (X^ events to 
calculate the fraction of electrons (muons) , fulfilling the 
requirements defined in Sect. IIIIBI and IIII C[ that pass 
the trigger requirement under study. We selected events 
triggered by a single electron (muon) trigger and require 
the presence of two reconstructed electrons (muons) ful- 
filling the tight selection criteria defined in Sect. IIII Bl 
pil Cp for electrons (muons), respectively. The invariant 
mass of the two selected leptons is required to be within 
a window around the Z mass, 80GeV< Mu <100GeV. 
We choose one electron (muon) as a "tag" and require 
it to have pt above 20 GeV and to be matched to an 
electron (muon) object at all relevant trigger levels. We 
use the other "probe" electron (muon) to calculate the 
efficiency of the trigger criterion studied. If both leptons 
fulfill the tag requirements, each of them serves both as 
a tag and as a probe. 

Figure [1] shows the measured probability that the elec- 
tron passes the L3 condition and the parameterization 
used in the analysis for the last data-taking period. Fig- 
ure 121 shows the measured muon trigger efficiencies for 
the first and second data-taking periods. The measured 
efficiency is parameterized as a function of the muon r/dct 
with the fit function chosen to be symmetric in 77dct- Both 
the muon detector geometry and the details of offline 
reconstruction contribute to the observed shape of the 
distribution. We do not use the dependence of the trig- 
ger efficiency on the muon transverse momentum in the 
parameterization. However, due to the spread of efficien- 
cies observed, an overall uncertainty of ±2 % is added in 
quadrature to the statistical fit uncertainty. 
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FIG. 1: Electron L3 trigger efficiency for the last data-taking 
period and its parameterization as a function of the electron 
Et. 
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e + jets channel 



Trigger Name 


/ Cdt 

(pb-i) 


Level 1 


Level 2 


Level 3 


EM15_2JT15 


128 


1 EM tower, Et > 10 GeV 
2 jet towers, pT > 5 GeV 


le, Et > 10 GeV, /em > 0.85 
2 jets, Et > 10 GeV 


1 tight e, Et > 15 GeV 
2 jets, PT > 15 GeV 


E1_SHT15.2J20 


244 


1 EM tower, Et > 11 GeV 


None 


1 tight e, St > 15 GeV 
2 jets, Pt > 20 GeV 


E1_SHT15.2J_J25 


53 


1 EM tower, Et > 11 GeV 


1 EM cluster, Et > 15 GeV 


1 tight e, St > 15 GeV 
2 jets, Pt > 20 GeV 
1 jet, Pt > 25 GeV 


fi + jets channel 


Trigger Name 


/ Cdt 
(pb-i) 


Level 1 


Level 2 


Level 3 


MU.JT20.L2M0 


132 


1/j, \ri\ < 2.0 
1 jet tower, pt > 5 GeV 


1/t, |r;| < 2.0 


1 jet, PT > 20 GeV 


MU.JT25.L2M0 


244 


lAt, \ri\ < 2.0 
1 jet tower, pT > 3 GeV 


Ifi, \ri\ < 2.0 
1 jet, PT > 10 GeV 


1 jet, PT > 25 GeV 


MUJ2.JT25 


30 


1^, |r?| < 2.0 
1 jet tower, pT > 5 GeV 


l/x, |r)| < 2.0 
1 jet, pT > S GeV 


1 jet, PT > 25 GeV 


MUJ2.JT25.LM3 


16 


1^, |r?| < 2.0 
1 jet tower, pT > 5 GeV 


l/x, |r;| < 2.0 
1 jet, PT > S GeV 


Ip., \r)\ < 2.0 
1 jet, PT > 25 GeV 



TABLE 1: Trigger requirements for different data-taking periods. 



We measure jet trigger efficierLcies in a sample of data 
events which fire one of tlie many muon triggers present 
in a set of triggers corresponding to a data-taking pe- 
riod of interest. The jet trigger efficiencies are param- 
eterized as a function of jet px in three regions of the 
calorimeter: CC (|77dct| < 0.8), ICD (0.8 < |7?dct| < 1-5) 
and EC (|77dct| > 1.5). An example of the parameteriza- 
tions obtained for the second data-taking period is shown 
in Fig. [31 Systematic uncertainties associated with the 
method are evaluated by varying the jet sample selec- 
tion. The difference between the efficiencies derived in 
different samples is added in quadrature to the statisti- 
cal uncertainty of the fits. 



D. Monte Carlo simulation 

We use Monte Carlo simulated samples to calculate 
selection efficiencies and to simulate kinematic charac- 
teristics of the events. Top quark signal and M^-|-jets 
and Z-l-jets background processes are generated at ^/s = 
1.96 TeV using ALPGEN 1.3.3 [l^l for the appropriate 
matrix element simulation and PYTHIA 6.202 1211 for 
subsequent hadronization. We used the CTEQSl [22] par- 
ton distribution functions for modeling the initial and fi- 
nal state radiation, decays and hadronization in pythia. 
The "tune A" [2^ parameter set is used for simulating 
the underlying event. Minimum bias simulated proton- 
antiproton events are superposed on all simulated events 
after hadronization. 

In the tt signal simulation we set the top quark mass to 
175 GeV and choose the factorization scale for calculation 



of the tt process to be Q = m^. We use evtgen [24 | 
to provide the branching fractions and lifetimes for all b 
and c hadrons. The main background consists of M^-|-jets 
and is simulated at the factorization scale + J2Pt 
where Mw is the W boson mass and pTj is the transverse 



momentum of the jet j in the event. For Z-fjets events 
the scale is set to the squared invariant mass of the lepton 
pair Mfg. We include virtual photon process (Drell-Yan 
production) and the interference between the photon and 
Z boson in the model. 

Generated events are processed through the GEANT- 
based [l^ simulation of the DO detector and are recon- 
structed with the same program as used for collider data. 



E. Calibration of Monte Carlo simulations 

We smear (i.e., convolute with a Gaussian) the recon- 
structed inverse energies of electrons and inverse trans- 
verse momenta of muons and jets in the simulation to 
improve the agreement with the observed momentum 
resolutions in data, as already described in Sect. IIII Bf - 
ImPl In addition, we correct the simulation for possi- 
ble inaccuracies in describing individual object identifica- 
tion efficiencies. We derive correction factors to account 
for the difference in the following efficiencies between 
data and the simulation: (i) electron (muon) reconstruc- 
tion and identification, (ii) electron (muon) track match, 
(iii) electron likelihood, (iv) muon isolation, (v) muon 
track quality and the distance of closest approach sig- 
nificance (requirements iii and iv of Sect. IIII C[ respec- 
tively), (vi) primary vertex selection, and (vii) electron 
(muon) promptness by comparing the efficiencies mea- 
sured in Z ^ £^£~ data events to the ones obtained from 
the simulation. Two typical examples of the methods 
used to determine correction factors and their system- 
atic uncertainties are provided below. 

To measure the efficiency of electron (muon) recon- 
struction, we use the same tag and probe method as that 
used in the trigger efficiency calculation. To avoid bias 
due to trigger requirements events used for the measure- 
ment have to be recorded with a single electron (muon) 
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FIG. 2: Muon trigger efficiencies for the first and second data- 
taking periods. The parameterization as a function of the 
muon T^det is shown in the upper plot as the dashed hue, the 
statistical error of the fit added in quadrature with the sys- 
tematic uncertainty is given by the band. The lower plot 
shows the muon trigger efficiency as a function of the muon 
Pt and the chosen central value along with the uncertainty 
band. 



trigger, and we require a tag electron (muon) be matched 
to the electron (muon) trigger object at all trigger lev- 
els. We repeat the same measurement using simulated 
Z £~^i~ events and plot the ratios of the efhciencies 
as a function of detector 77, cj) and pr for muons and, 
additionally, as a function of the distance to the closest 
jet in the event for electrons to probe the dependence of 
the electron reconstruction on the jet activity. Since no 
strong dependence on any of these quantities is found, we 
use inclusive factors to correct simulation to data yield- 
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FIG. 3: The trigger efficiency for a jet to pass LI, L2 and 
L3 trigger requirements for the three different calorimeter re- 
gions: CC (top), ICD (middle) and EC (bottom). 



ing 0.98±0.027 (syst) (1.00±0.04 (syst)) for the electron 
(muon) reconstruction and identification efficiency. Sys- 
tematic uncertainties are assigned based on the spread 
of measured ratios in the A7?.(e, jet) distribution for elec- 
trons and |?7dct| for muons. 

We determine the efficiency of finding a track matched 
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to a electron (muon) by applying the same tag and probe 
method to Z ^ £~^£~ events selected with a tight electron 
(muon) as a tag and an electromagnetic cluster satisfying 
criteria of Sect. IIIIBI (a muon identified in the 

muon chambers) as probe for electrons (muons). The cor- 
rection factors obtained by comparing efficiencies in data 
and the simulation are found to be 0.983 ± 0.007 (syst) 
for electrons and 0.99±0.03 (syst) for muons. Systematic 
uncertainties arise mainly from the minor dependence of 
the correction factors on the pt, and (p of the leptons. 
An example of such a dependence is shown in Fig. |4] for 
the electron track match efficiency. 



V. METHOD OVERVIEW 

The analysis strategy is outlined briefly in the follow- 
ing. First, we select events that have the same signature 
as tt signal events decaying in the lepton-|-jets channel, 
i.e., a truly isolated Icpton and genuine from the W 
boson decay. Multijet events produced by strong inter- 
actions are expected to contain neither isolated leptons 



nor ^T- However, they are present in the selected sam- 
ples due to the imperfect reconstruction in the detector. 
In particular, the selected e-|-jets sample contains contri- 
butions from multijet events in which a jet is misiden- 
tified as an electron. Events where a muon originating 
from the semileptonic decay of a heavy quark appears 
isolated contribute to the selected sample in the /i-|-jets 
channel. Significant can arise from fluctuations and 
mismeasurcments of the muon and jet energies. In order 
to model these effects, we use a dedicated data sample to 
describe the kinematic properties of the surviving multi- 
jet events. 

The background within the selected samples is dom- 
inated by W-l-jets events. Its contribution is estimated 
using Monte Carlo simulations. We validate the back- 
ground model by comparing observed distributions to 
the predictions from our model in samples of events with 
low jet multiplicities where only a small signal fraction 
is expected. For these comparisons we assume a tt pro- 
duction cross section of 7pb as predicted in the SM. The 
Kolmogorov-Smirnov probability for data and simulation 
to originate from the same underlying distribution is used 
as an estimator of the quality of the background model 
and generally good agreement is found. 

To extract the fraction of tt events in the samples, we 
select kinematic variables which discriminate between the 
iy-|-jets background and the tt signal, and combine them 
into a discriminant function. The selected variables are 
required to be well described by the background model. 

In a final step, we derive the discriminant function for 
the observed data, the tt signal and the electrowcak and 
multijet backgrounds. A Poisson maximum-likelihood fit 
of the signal and background discriminant distributions 
to that of the data yields the fraction of ti signal and 
the electroweak and QCD multijet backgrounds in the 
data sample. Finally, the ti production cross section is 
computed from the number of fitted tt events. 

In contrast to the tt cross section measurement pre- 
sented in Ref. [1^ , we do not take advantage of the fact 
that two jets are expected to contain displaced vertices 
due to the &-quark decays for signal events. Our cross 
section estimation is based solely on the different kine- 
matic properties of the signal and background events. 



VI. EVENT SELECTION 

In both channels, we select events containing one Icp- 
ton with Pt > 20 GeV that passes the tight iden- 
tification criteria, originates from the primary vertex 
(|Az(€, PV)| < 1cm), and is matched to trigger objects 
at all relevant levels. We accept muons with |?7dct| < 2.0 
and electrons with |77dct| < l-l- This choice of cuts is 
motivated by the acceptance of the DO muon system and 
central calorimeter, respectively. Jets in the event are re- 
quired to have \t]\ < 2.5 and px > 20 GeV except for the 
highest pt jet which has to fulfill pT > 40 GeV. Events 
with a second isolated high transverse momentum Icpton 
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are studied elsewhere [27| and explicitly vetoed in the 
event selection to retain orthogonality between analyses. 

In both channels we require > 20 GeV to reject 
multijct backgrounds. However, a significant fraction of 
multijct events survive this cut due to the presence of 
heavy flavor decays or jet energy mismeasurement. These 
events typically have either in the direction of the 
lepton or back-to-back to it. Figure [5] illustrates the dif- 
ference in the angular distribution of and the lep- 
ton {Aip{£,^t)) between signal and multijet background 
events which we exploit to further suppress the latter. 

We performed a grid search in the (^t, A(p(^,^7-)) 
plane to find cuts that provide the highest product of 
efhciency and purity for tt events, where purity is de- 
fined as the ratio of the number of signal tt events to the 
total number of events in the selected sample. The op- 
timal cuts are found to be A(p{e,^T) > O.Ttt — 0.045^t 
and Atp{pL,$T) > 0.67r(l -^T/(50GeV)) in e-^jets and 
/i-|~jcts channels respectively in addition to the common 
> 20 GeV cut. 

The /i-|-jets channel suffers from a significant contribu- 
tion of Z(/x/i)-|-jets events which pass the selection cri- 
teria due to poor resolution in events with four or 
more jets. A cut on the invariant dimuon mass of the 
selected isolated high pt muon and the additional high- 
est pt muon with relaxed quality requirements is applied 
at 70 GeV < M^^ < 110 GeV and rejects roughly 27% 
of the Z /i/i+jet background while keeping almost 
100 % of the signal in the selected sample. The remain- 
ing Z /i/i-|-jcts background cannot be rejected since no 
second muon is reconstructed mainly for reasons of finite 
acceptance. 

The tt event selection efficiency is measured using sim- 
ulated events with respect to all ti final states that con- 
tain an electron or a muon originating either directly 
from a W boson or indirectly from the W ^ tv de- 
cay. The branching fractions of such final states are 
17.106% and 17.036% [ij for the e-hjets and n+jets 
channels, respectively. After applying the correction fac- 
tors discussed in Sect. IIV El and the trigger efficiency pa- 
rameterizations (Sect. IIVB[) to the simulated tt events, 
the final selection efficiencies yield (9.17±0.09)% and 
(9.18±0.10)% in the e-|-jets and /x-f-jets channel, respec- 
tively. The quoted uncertainties are statistical only. 



ments of the jet energies and the muon momentum in ad- 
dition to neutrinos originating from semi-muonic heavy 
quark decays. These instrumental backgrounds are col- 
lectively called "multijet backgrounds" , and their contri- 
bution is estimated directly from data since Monte Carlo 
simulations do not describe them reliably. 

In order to estimate the contribution of the multijet 
background to the selected data samples we define two 
samples of events in each channel, a "loose" and a "tight" 
set where the latter is a subset of the former. The loose 
set (containing events) corresponds to the selected 
sample described in the previous paragraph, but with 
only the loose lepton requirement applied. The tight 
sample (containing Nt events) additionally demands the 
selected lepton to pass the tight criteria and is identi- 
cal to the selected sample. The loose sample consists of 
N'^ events with a truly isolated lepton originating from 
P^-l-jets, Z-l-jets or events and N'' multijet background 
events with a fake isolated lepton: = N'^ + N^. The 
tight sample consists of e^iV* tt signal and electroweak 
background events and EbN^ multijct background events, 
where Eg and Sb arc the efficiencies for a loose lepton to 
also fulfill the tight lepton requirements. 

Solving the system of linear equations for N'^ and N'^ 
yields: 



iV-=^--^^^^ and iV^ = £f^!l^, (3) 



£b 



£b 



and allows the determination of the size of the multijet 
background contribution in the selected sample. As for 
the shape of the multijet background, for a given vari- 
able it is predicted using a data sample where the full 
selection has been applied except for the tight lepton re- 
quirement. Instead, the requirements on the muon iso- 
lation in the //-|-jets channel and electron likelihood in 
the e+jets channel arc inverted, selecting a data sample 
enriched in events originating from multijet production 
processes ( "loose— tight" data sample). However, truly 
isolated leptons from tt and W/Z+^ets events will leak 
into this sample. The composition of the "loose— tight" 
{Ni-t = Ni — Nt) data sample can be derived from Eq.[3l 



VII. BACKGROUNDS 



1 



-iV/ -t- N, 



(4) 



A. Multijet background evaluation 

The background within the selected samples is domi- 
nated by VF-|-jets events, which have the same final state 
signature as tt signal events. However, the samples also 
include contributions from multijet events in which a 
jet is misidentified as an electron (e-|-jets channel) or in 
which a muon originating from the semileptonic decay of 
a heavy quark appears isolated (/x-|-jets channel). Sig- 
nificant can arise from fluctuations and mismeasure- 



where = EgN^ is the number of preselected tt and 
electroweak background events as estimated in the fol- 
lowing section and N^_i is the pure multijet contribution 
to the "loose— tight" preselected sample. Using Eq. 21 
the contaminations from ti and electroweak backgrounds 
are subtracted bin-by-bin from the distribution of the 
"loose— tight" preselected data sample in order to pre- 
dict the shape of the pure multijet contribution for each 
individual discriminant input variable under considera- 
tion. 
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FIG. 5: A^(e,_^T) versus fir in the multijet QCD data sample, tt Monte Carlo, PF+jets Monte Carlo and in data. The lines 
represent the cuts optimized for the fourth inclusive jet multiplicity bin. 



1. Sb determination 



The rate Sb at which a lepton in multijet events appears 
isolated is measured in a data sample which passes the 
same requirements as the selected one but without apply- 
ing the -^T-related set of cuts discussed in Sect. I VII In 
this data sample, we calculate et, the ratio of the number 
of tight events to the number of loose events, as a func- 
tion of ^T- We find that it is constant for < 10 GeV, 
shown in Fig. [6] (top) for the /x-|-jets channel, as expected 
for a sample dominated by the multijet events. The value 
of Eb given by the constant fit to data in < 10 GeV 
region is used in the analysis. 



In the muon channel, et does not show significant de- 
pendence on the jet multiplicity and does not change 
between different data-taking periods. However, rather 
strong dependences are observed with respect to the 
muon 7]dot (Fig. [6] middle) and transverse momentum 
(Fig. [5] bottom) . We estimated the effect of these de- 
pendences on the inclusive Sb by folding them in with 
the muon T^det and pr spectra of the selected sample. 
Since the small number of events with four or more re- 
constructed jets in the low sample does not allow for 
a precise measurement we determine from the events 
with three or more jets and assign systematic uncertainty 
from the difference between the flat and the muon px- 
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17.8 ± 2.0 (stat) ± 3.1 (syst)% 



(5) 



In the electron channel we find no significant depen- 
dence of Eh on the jet multiplicity and electron Ty^ct ^^nd 
Pt- However, we observe a statistically significant vari- 
ation of Eb between different data-taking periods. In 
particular, we find a higher value of £& for events col- 
lected during the second data-taking period than during 
the first one. We attribute this increase to the more 
stringent electron shower shape requirements applied at 
trigger level 3, which improves the quality of the fake 
electrons that enter our loose sample, making them more 
likely to pass the tight criterion. Figure [7] shows the elec- 
tron isolation efficiency as a function of for events 
with two or more jets, obtained separately for data col- 
lected during three data-taking periods. A fit to these 
distributions in the region of < 10 GeV yields corre- 
sponding Eb- In Eq. [4]we use a luminosity- weighted av- 
erage Eb obtained by analyzing events with two or more 
jets: 



Eb = 16.0 ± 1.2 (stat) ± 8.0 (syst)% . 



(6) 



The systematic uncertainty of Eb arises from the small 
observed variation as a function of jet multiplicity and 
electron pt- 



2. Es determination 

The probability Es that a truly isolated lepton (i.e., a 
lepton originating from W boson decays) from a loose 
sample will survive the tight isolation requirements is 
measured using simulated VF-fjets events with four or 
more jets and corrected with the simulation-to-data scale 
factor independent of jet multiplicity. 

In the muon channel, e^ depends on the muon px spec- 
trum, shown in Fig. [51 and hence is slightly different for 
M^-|-jets and tt events. In the signal jet multiplicity bin 
(Njot > 4) , we add the fraction of ti events corresponding 
to the expected tt cross section of 7 pb and obtain: 



81.8 ±0.7 (stat) tf l (syst)%. 



(7) 



where the systematic uncertainty is derived by varying 
the ti fraction between and 100 %. In the electron chan- 
nel, Es determined from the simulated ti events agrees 
with the one obtained from M^+>4 jets events and yields: 



Es = 82.0 ± 0.7 (stat) ± 1.3 (syst)% . 



(8) 



Systematic uncertainty arises from the uncertainty on the 
simulation-to-data correction factor. 



B. Expected sample composition 



FIG. 6: Tight muon isolation efficiency et measured in the 
QCD multijet background dominated data sample as a func- 
tion of (top), the muon rjdct (middle) and pr (bottom) in 
/i-f jets channel. 



Equation [3] is applied separately to events selected 
in bins of jet multiplicity for both the e-|-jets and 
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FIG. 7: Electron likelihood efficiency in the QCD multijet background dominated data sample as a function of for events 
with two or more jets, for the first (left), second (middle) and third (right) data-taking periods. The constant fit to the region 
of < 10 GeV is used to determine the value of et, used in the analysis. 
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FIG. 8: Es as a function of muon pr for simulated H^+jets (tri- 
angles) and tt (squares) events. 



fi+jets channels. The yields of multijet events Nj^ and 
events with a real isolated lepton A^* in the selected sam- 
ple with four or more jets are summarized in Table |2l 
Several physics processes contribute to signal-like events 
in the selected sample: ti pair production decaying 
into the ^-|-jets final state, tt pair production decaying 
into two leptons and jets, ti — > Wviv^bb, where both W 
bosons decay leptonically, and electroweak background 
with contributions both from M^-l-jets and Z+]ets events. 

We estimate the amount of Z-|-jets background rela- 
tive to the M^-|-jets background using the cross sections, 



branching fractions and selection efficiencies determined 
using simulated events for both processes: 



Z-l-jets 



B 



Z^t^fi gZ+jcts 

X X IMw+jets ■ 



CVF-l-jots ■ Bw^i^iv £VF+jots 



In the /.t+jets channel, the ratio of the Z-|-jets contribu- 
tion to the total electroweak background in the selected 
sample is measured to be 7%. In the e-|-jets channel, the 
Z+jets background is negligible. 

The expected contribution from the tt dilepton chan- 
nel is evaluated assuming a standard model cross section 
of 7 pb for ti pair production. The fully corrected effi- 
ciencies to select ti dilepton events are found to be 0.6% 
and 0.5% in the e-|-jets and /i-|-jets channel, respectively. 
This results in a 2.0% (2.3%) contribution of dilepton 
events into the /^+jets (e-|-jets) final state. 



VIII. KINEMATICAL ANALYSIS 

The background within the selected samples is domi- 
nated by W-|-jets events, which have the same final state 
as ti signal events. To extract the fraction of ti events 
in the sample we construct a discriminant function that 
exploits the differences between the kinematic proper- 
tics of the two classes of events: ti signal and W-|-jets 



channel 


es{%) 


£(,(%) 


Ni 


Nt 


m 


m 


/i-|-jets 
e-f jets 


81.8±0.7 
82.0±0.7 


17.8±2.0 
16.0±1.2 


160 
242 


100 
119 


8.6±2.0 
19.2±2.3 


91.4±10.7 
99.8±11.6 



TABLE 2: Selected sample composition determined using 
Eq|31 Only the statistical uncertainties are quoted where ap- 
propriate. 
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background. All other backgrounds are small and do not 
justify the introduction of an additional event class. 



A. Discriminant function 

The discriminant function is built using the method 
described in Ref. [1^, and has the following general form: 



S{xi,X2, ■ 



S{xi,X2, ...) + B{xi,X2, ...) 

where xi, X2, ... is a set of input variables and S{xi^ X2, ■■■) 
and B(xi,X2,---) are the probability density func- 
tions (pdf) for observing a particular set of values 
{xi,X2, ■■■,Xn) assuming that the event belongs to the 
signal or background, respectively. Neglecting correla- 
tions between the input variables, the discriminant func- 
tion can be approximated by: 



V = 



U^S^{x^)/b^{x^) 

Hi Si{xi)/bi{xi) + 1 ' 



(10) 



where Si{xi) and hi[xi) are the normalized pdf's of each 
individual variable i for tt signal and W-l-jets back- 
ground, respectively. In the analysis, we express the dis- 
criminant as 



V 



exp^EJlnfffT^' 
exp(E.(ln|fe}))+l 
exp(E.(ln|{fy)fittod 



(11) 



b(xi) /fitted 



1 



where {\^^)\ttcd ^ 1"^^ ^^^'^ logarithm of the ratio of 
the signal and background pdfs for each kinematic vari- 
able i. The application of a fit to the logarithm of the 
signal to background pdf ratios reduces the influence of 
individual events on the discriminant output. 



B. Selection of discriminating variables 

All possible observablcs with different pdfs for M^-l-jets 
and tt events have the ability to discriminate between the 
two. As a first step toward the goal of selecting an opti- 
mal set of discriminating input variables, we first evaluate 
the separation power for a large set of individual vari- 
ables by estimating the expected total uncertainty of the 
ti cross section when using the variable under considera- 
tion as sole discriminator. Variables are then ranked and 
selected by increasing uncertainty. The total expected 
uncertainty is estimated by adding the systematic un- 
certainties related to jet energy scale (JES), jet energy 
resolution (JER) and jet reconstruction efficiency (JID) 
in quadrature to the statistical uncertainty: 



CTtot 




JES 



'JER 



'JID- 



(12) 



The optimization is done in e-f jets and /i-|-jets channels 
separately. Therefore the statistical uncertainty in Eqfl2l 
is reduced by a factor of 1/ \/2, since the additional data 
from the complementary channel roughly doubles the 
statistics in the combination. We select a set of thir- 
teen variables described in Appendix as input for the 
second step of the discriminant function optimization. 



C. Optimization of the discriminant function 

The optimization procedure, determining which com- 
bination of topological input variables will form the fi- 
nal discriminant function, is performed by estimating 
the expected combined statistical and systematic uncer- 
tainty of the measured ti cross section. The expected 
uncertainty is calculated for all discrimination functions 
that can be constructed from the selected input variables 
by using all possible subsets of the 13 variables in turn 
as input. Pseudo-experiments are performed by draw- 
ing pseudo-data discriminant output distributions from 
the output discriminant distributions of simulated events. 
The composition of such a pseudo-dataset is taken ac- 
cording to the expected sample composition for iVjot > 4 
and Utt — 7pb and allowing Poisson fluctuations. 3,000 
pseudo-experiments are built for each source of statisti- 
cal or systematic uncertainty and discriminant function 
under consideration. We select the discriminant function 
which provides the smallest expected total uncertainty, 
including all sources of systematic uncertainties that af- 
fect the shape of the discriminant function. 

In the ^-|-jets channel the discriminant constructed 
with the following five input variables shows the best 
performance: (i) Ht, the scalar sum of the px of the 
four leading jets; {ii) ^rf, the sum of the squared pseu- 
dorapidities of the four leading jets; [iii) Mt, transverse 
mass of the four leading jets; {iv) the event centrality C, 
defined as the ratio of the scalar sum of the px of the jets 
to the scalar sum of the energy of the jets; (u) the event 
aplanarity A, constructed from the four-momenta of the 
Icpton and the jets. Aplanarity characterizes the event 
shape and is defined, for example, in Ref. [29J. 

In the e-|-jets channel the optimal discriminant func- 
tion is found to be built from six variables: (i) NJW, the 
weighted number of jets in the event; [ii) the event cen- 
trality C; {Hi) the event aplanarity^; {iv) |?7jet|™'*'', \r]\ of 
the jet with maximum pseudorapidity; {v) minimum of 
the invariant mass of any two jets in the event; {vi) Mt, 
transverse mass the four leading jets. The normalized 
distributions of the selected kinematic variables for tt sig- 
nal and the M^-l-jets background are shown in Fig. [9] and 
Fig. [TO] for ^-|-jets and e+jets channels, respectively. A 
more detailed explanation of the used variables is given in 
Appendix [X] Figure [11] demonstrates that distributions 
in data of the kinematic variables selected as input to the 
discriminant are well described by the sum of expected 
ti signal, VK-|-jets and multijet background contributions 
for events with three jets dominated by the background. 
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The discriminant function is built according to Eq. 1111 
from the fits to the logarithm of the ratio of signal (tt) 
over background (M^+jets) based on simulated events. 
Finally, the fully defined discriminant function is evalu- 
ated for each physics process considered in this analysis. 
For this purpose, we use simulated tt events with ^-|-jets 
and dilepton final states, W/Z+jets events, and the mul- 
tijet background data sample selected by requiring that 
the lepton fails the tight selection criterion. An exam- 
ple of the discriminant distributions for the ti signal and 
main backgrounds in e-|-jcts channel is shown in Fig. 1121 
By construction, the discriminant function peaks near 
zero for the background, and near unity for the signal. 



IX. CROSS SECTION EXTRACTION 
A. Method 

The number of ti events in the selected data sample 
is extracted by performing a maximum likelihood fit to 
the discriminant distribution observed in data using tem- 
plates for the tt signal, multijet and W/Z+jets (W+iets) 
backgrounds in the //-f jets (e-|-jets) channel. The Z-|-jets 
contribution is added to the M^-|-jets discriminant tem- 
plate according to its fraction determined in Sect. IVlfBl 
resulting in a combined electroweak background template 
in the n+jets channel. Similarly, the contributions from 
the dilepton and £-|-jets ti signals are combined into a 
single ti template before fitting by adding the dilep- 
ton contribution to the ^-|-jets template. Dilepton and 
.Z-|-jets admixtures introduce only small corrections to 
the ti ^-|-jets and M^-|-jets template shape, respectively. 

We consider three different contributions to the maxi- 
mum likelihood fit: ti, M^/Z-|-jets and multijet, and con- 
strain the relative fraction of the latter using EqlS] This 
is realized by defining the following likelihood function: 



L{Nf,Nr,N^) 



where 'P{n,^) denotes the Poisson probability density 
function for n observed events given an expectation value 



of ji and Nj 



Nf are the numbers of tt, 



W/Z+iets 



and multijet events in the selected sample, respectively. 
In the first term of Eq. 1131 i runs over all bins of the dis- 
criminant histogram; n° is the content of bin i measured 
in the selected data sample; and fj-i is the expectation for 



bin i, which is a function of N^*. 



and as given 



by: 




ibb 266 366 466 S)0 
Ht [GeV] 




260 460 660 

Mr [GeV] 
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ffNf + frNr + f^N^, (14) 
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aplanarity 



where /** , , fi represent the fractions in bin i of the 
ti, W^-|-jets and multijet discriminant templates (shown 
in Fig. [T2] for e-fjets channel), respectively. The second 
term of Eq. [T2] effectively implements the constraint on 



FIG. 9: Distributions of the five variables used as input to the 
likelihood discriminant in the /i-fjets channel. The ti signal 
(solid line) and combined W/Z+jets electroweak backgrounds 
(dashed line) are derived from simulations. 
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aplanarity Jet r; max 



above the minimum while all other parameters of the 
fit are allowed to float. The results of the fits are listed 
in Table [3] and the corresponding correlation coefficients 
are summarized in Tabled) 



channel 


Nf 






e+jets 
^+jcts 


0' -0-12.9 

21 1+"''^ 

^-'-■-'--9.7 


72.61";? 


19.3 ±2.0 

8-1^1;? 



TABLE 3: Fitted number of tt, W+jets and multijet back- 
ground events in the selected sample in the e+jets and 
/i+jets channels. includes the .Z+jets contribution in 

the /i+jets sample. 



e+jets /i+jets 
Nf iVT iVf iVT W 

Nf +1.00 -0.63 -0.11 +1.00 -0.59 -0.14 
N^ +1.00 -0.23 +1.00 -0.23 

nI; +1.00 +1.00 



^0.15 



0.1 



0.05 







0) 

to 








o 

^0.15- 








1 o.i: 








0.05- 




n J 







- . . r. 





50 100 150 

minimum di-jet mass [GeV] 



200 



400 600 
Mt [GeV] 



FIG. 10: Distributions of the six variables used as input to the 
likelihood discriminant in the e+jets channel. The tt signal 
(solid line) and Vf^+jets background (dashed line) are derived 
from simulations. 



Nj^ via the Poisson probability of the observed number 
of events in the "loose— tight" (-/V^Lj sample, given the 
expectation (iVf_t). The latter can be expressed as: 



1 



'-Nf 



1 



'-n: 



1 



(15) 



Thus, the task is to minimize the negative log-likelihood 
function: 



-logL(iV, 
^(-< log ii^+fi^)- N^_t log Ni_t + 



i-t 



TABLE 4: Matrices of correlation coefficients of the likelihood 
fit in the e+jets and /t+jets channels. 



One complication arises due to the fact that the shape 
of the discriminant for the multijet background is ob- 
tained from the "loose— tight" data sample which has a 
small contribution from W^+jets and tt events (Eq. [T5|) . 
The contamination of the multijet template is taken into 
account by using the corrected expected number of events 
in each bin of the discriminant function 



M,(A^r,A^r,K) 



ffNf+frNr)x II 



Sb 1 



1 - £b £s 



+ 



n m + 



£b 1 



1 - £b £s 



Nf 
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w 



in place of the one of Eq. [141 

Figure [T3] shows the distributions of the discriminant 
functions for data in e+jets and /i+jets channels along 
with the fitted contributions from the ti signal and 
M^+jets and multijet backgrounds. 



B. Cross sections in individual channels 

The tt production cross section for an individual chan- 
nel j is computed as: 



where any terms independent of the minimization pa- 
rameters are dropped. The fitted parameters {Nf, Nf^, 
Njl) are given by their value at the negative log- likelihood 
function minimum, and their uncertainties are obtained 
by raising the negative log-likelihood by one-half unit 



NfU) 



(16) 



where Nf{j) is the number of fitted tt events in channel 
J, Bj is the branching fraction for the final state where 
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FIG. 11: Distributions of selected variables used as input to the discriminant in data overlaid with the predicted background 
and expected ti signal for the events with exactly three jets in the e+jets (top row) and /i+jets (bottom row) channels. 
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FIG. 12: Discriminant function output distributions for the 
ti signal (solid line), W-|-jets background (dashed line) and 
multijet events (dotted line) in the e+jets channel. 



the Icpton is allowed to originate either directly from a 
W boson or from the W ^ tv decay (Sycts); is the 
integrated luminosity and Ej is the tt selection efficiency. 



The efficiencies e are obtained by correcting the ti ^-|-jets 
selection efficiencies eijets for the ti dilepton final state 
contribution: 

£ = Eljots -I- „ , (17) 

t>ljots 

where eu and Be£ are the selection efficiency and the 
branching fraction for the ti — > ^£+jcts decay channel. 

The input values for the likelihood fit are summarized 
in Table El 



channel Ni Nt B 


c (pb-^) 


eijots (%) 


e{%) 


e + jets 242 119 0.17106 


425 


9.17 


9.39 


M + jets 160 100 0.17036 


422 


9.18 


9.36 



TABLE 5: Number of selected events in the loose (A'^) and 
tight (At) sample, branching fraction {B), integrated luminos- 
ity (£), selection efficiency for — > £+jets (eijots) and total 
selection efficiency (e) in e-|-jets and /j,-|-jets channels. 



The ti production cross sections at y/s =1.96 TcV for 
a top quark mass of 175 GeV in the e-|-jets and /i-|-jets 
channels are measured to be: 

e jets : au = 9.9li;g (stat) ± 1.0 (syst) ± 0.6 (lum) pb; 
Ai -f jets : au = 3.ll};| (stat) ± 0.4 (syst) ± 0.2 (lum) pb. 

We estimate the probability to observe the cross sec- 
tions measured in the ^-|-jets and e-|-jets channels by gen- 
erating pseudo-measurements using the expected statisti- 
cal and systematic uncertainties evaluated at an assumed 
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FIG. 13: Discriminant distribution for data overlaid with the 
result from a fit of tt signal, and VK+jets and multijet back- 
ground in the e+jets (upper plot) and /i+jets (lower plot) 
channel. 



C. Combined cross section 

The combined cross section in the lepton+jets channel 
is estimated by minimizing the sum of the negative log- 
likchhood functions of each individual channel. A total of 
five parameters are simultaneously fitted: cr^f (common 
to both lepton channels) and N^{j) and N^{j) sepa- 
rately for each channel. Figure [U shows the distribution 
of the discriminant function for data along with the fit- 
ted contributions from ti signal, W^-|-jets, and multijet 
background events which are found to be 40%, 48% and 
12%, respectively. The combined cross section for a top 
quark mass of 175 GeV is: 

crtt(175 GeV) = QAt\l (stat) ± 0.7 (syst) ± 0.4 (lum) pb. 
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FIG. 14: Discriminant distribution for data overlaid with the 
result from a fit of ti signal, and VK+jets and multijet back- 
ground in the combined i'-l-jets channel. 



true cross section of 7pb. We find that the correlation 
between two measurements is small, 3.3%, since the dom- 
inant uncertainties are of a statistical nature and there- 
fore uncorrelated between the channels. The probability 
to observe the tt cross section above 9.9 pb (below 3.1 pb) 
in one of the channels given the true cross section of 7 pb 
is 9% (4%). 

We estimate the consistency of the cross sections ob- 
served in the e-|-jets and /.i+jets channels by generating 
pseudo-experiments that incorporate all the correlations 
between the different sources of systematic uncertainties 
assuming the measured cross sections in the individual 
channels. From the distribution of the differences be- 
tween the e-l-jets and ^-fjets cross sections observed in 
each pseudo-experiment, we conclude that the cross sec- 
tions agree within 2.4 standard deviations. The differ- 
ence observed between the measured cross sections is at- 
tributed to a statistical fluctuation. 



We have studied the dependence of the measured cross 
section on the top quark mass by using the samples of 
simulated ti events with different top quark masses to 
evaluate signal efficiencies and discriminant function out- 
puts and repeating complete analysis. Figure [T51 shows 
the dependence of the combined cross section in the lep- 
ton-|-jets channel on the top quark mass. Solid line rep- 
resents the fit to the measured cross sections for various 
masses of the top quark. For 170 GeV< mtop <180 GeV 
the cross section changes as a function of mtop as: 

Oti{mtop) = cTtt - 0-1^ X ("itop - 175 GeV). (18) 

The kinematic distributions observed in lepton+jets 
events are well described by the sum of ti signal, 
M^/Z-|-jets, and multijet background contributions. An 
example of this agreement is illustrated in Figs. [1^] and 
[T7]for events selected requiring V < 0.5, i.e., dominated 
by background, and events in the ti signal region with 
V > 0.5. The two variables shown arc the lepton px and 
the highest jet px in the event and are not used as input 
to the discriminant function. 
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FIG. 15: The combined tt production cross section in the 
lepton+jets channel as a function of top quark mass compared 
to tlie theoretical calculations 13. 



X. SYSTEMATIC UNCERTAINTIES 

The systematic uncertainty on the tt production cross 
section in an individual channel j for each independent 
source of systematic uncertainty i is determined by vary- 
ing the source by one standard deviation up and down 
and propagating the variation into both the fitted num- 
ber of tt events and the signal efficiency resulting in a 
new value of the cross section in channel j: 



Nf{j) ± ANf{jY 



(19) 



Variations due to uncertainty sources which modify si- 
multaneously both the selection efficiency and the fitted 
number of tt events are treated as fully correlated. 

The variation of the fitted number of tt events due 
to each individual source i is estimated by generating 
10, 000 pseudo-experiments from simulated events. The 
sample composition of each pseudo-datasct is the same 
as the measured sample composition in data but allowing 
for Poisson fluctuations in the number of events from a 
specific contribution. The discriminant distribution for 
each pseudo-dataset is fitted in order to extract Nii once 
with the default discriminant function templates for tt^ 
H^/Z-|-jets and multijet background and once with the 
varied ones. The relative difference between the two re- 
sults is histogrammed. The relative systematic uncer- 
tainty is extracted from the histogram by performing a 
fit to a Gaussian distribution around the most probable 
value and using the mean of the fit as an estimator for 
the relative uncertainty on the fitted number of tt events 
from source i. 

Positive (negative) variations of the cross section Act] 
from each individual source of systematics with respect 
to the central value Uj fEq |19p arc summed quadratically 
to obtain total positive (negative) systematic uncertainty. 
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FIG. 16: Lepton pr distribution for i'-l-jets events in data with 
discriminant below 0.5 (upper plot) and discriminant above 
0.5 (lower plot), overlaid with the result from a fit of tt signal, 
and 'py+jets and multijet background. 



In addition, a systematic uncertainty of ±6.1% from the 
luminosity measurement is assigned [Tsj . By construc- 
tion, this method does not allow the systematic uncer- 
tainties to affect the central value of the cross section 

CTj. 

The systematic uncertainty on the combined cross sec- 
tion is estimated following the same procedure as de- 
scribed above taking into account the correlations be- 
tween individual sources of systematic uncertainties be- 
tween the channels. The systematic uncertainties are 
classified as either uncorrelated (usually of statistical ori- 
gin in either Monte Carlo simulation or data) or fully cor- 
related between the channels. In particular, we consider 
the systematic uncertainties coming from the primary 
vertex reconstruction, jet energy calibration, jet identi- 
fication, jet trigger, W background model, and branch- 
ing fraction to be fully correlated. Uncertainties associ- 
ated with the lepton identification, lepton trigger, mul- 
tijet background evaluation, and the limited statistics of 
Monte Carlo samples are taken as uncorrelated in the 
cross section combination. 
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FIG. 17; Leading jet pT distribution for f +jets events in data 
with discriminant below 0.5 (upper plot) and discriminant 
above 0.5 (lower plot), overlaid with the result from a fit of 
ti signal, and W+jets and multijet background. 



Table [S] summarizes the contributions from the vari- 
ous sources of systematic uncertainties to the total sys- 
tematic uncertainty on the cross sections in the e+jets, 
/i+jets and combined i+jets channels. The jet energy 
scale uncertainty dominates, followed by the uncertainty 
of the luminosity measurement. These two represent 80% 
of the total systematic uncertainty of the combined cross 
section. 



XI. SUMMARY 

We have measured the tt production cross section in 
the £+jcts final state by combining the measurements 
performed in the individual e+jets and /i+jets channels. 
The combined cross section for a top quark mass of 175 
GeV is 

£ + jets : att = G-^^IH (stat) ± 0.7 (syst) ± 0.4 (lum) pb. 

The result is in a good agreement with the theoretical 
predictions of 6.7j:g:^pb (0]) and 6.8±0.6pb (@) based 



Source 


e+jets 


M± 


jots 


^±jets 


Primary Vertex 


+0.30 - 0.28 


±0.12 


-0.10 


±0.24-0.21 


Lepton ID 


±0.32 


±0.17 


-0.16 


±0.22 


Jet Energy Scale 


+0.70 - 0.72 


±0.05 


- 0.16 


±0.47 


Jet ID 


+0.08-0.14 


±0.11 


- 0.02 


±0.03 - 0.08 


Trigger 


+0.05-0.21 


±0.09 


- 0.08 


±0.10-0.20 


W bckg model 


+0.11 - 0.21 


±0.13 


- 0.11 


±0.12-0.18 


Multijet bckg 


±0.04 


±0.13 


- 0.14 


±0.05 - 0.06 






±0 


.31 


ztU. 00 


B 


±0.20-0.19 


±0 


.06 


±0.14 


Subtotal 


±0.99 - 1.03 


±0.44 


±0.70 - 0.72 


Luminosity 


±0.64 


±0.20 


±0.42 


Total 


±1.18 - 1.21 


±0.45 


±0.82 - 0.83 



TABLE 6: Summary of systematic uncertainties of the cross 
section Aatt (pb). 



on the full NLO matrix elements and the resummation 
of the leading and next-to-leading soft logarithms. 
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APPENDIX A: KINEMATIC VARIABLES FOR 
DISCRIMINANT OPTIMIZATION 



We select a set of thirteen variables as input for the 
discriminant function optimization. These variables are 
designed to address different aspects of the tt signal and 
W±jets background kinematics: event energy, shape, 
location of the jets in the detector, properties of soft 
non-leading jets, etc. VF±jets background tends to have 
a lower event transverse energy, less energetic jets and 
smaller total invariant mass than tt events. Since the 
ti system is produced nearly at rest at the Tevatron and 
therefore is expected to have a much smaller boost in the 
beam direction than iy±jets, the jets from a tt event are 
more central. The tt event topology is also different from 
M^±jcts due to the different production mechanisms. 

We select the following thirteen variables for the dis- 
criminant fmiction optimization: 
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• Ht, the scalar sum of the pT of the four leading 
jets; 

• H'rp, Ht divided by the scalar sum of the absolute 
values of Pz of the jets, the lepton and the ]^t\ 

• My, transverse mass of the four leading jets; 

• A/ovcnt, invariant mass of up to four leading jets, 
the and the lepton in the event; 

• Event centrality C, defined as the ratio of the scalar 
sum of the px of the jets to the scalar sum of the 
energy of the jets; 

• Event aplanarity A = IA3 and sphericity S = 
|(A2 + A3), derived from the normalized momentum 

tensor, defined by Aiij ~ ^"^Vj^ , where p° is the 
momentum vector of jet o, i and j are Cartesian co- 
ordinates, and the eigenvalues A^ of M are ordered 
such that Ai > A2 > A3 with Ai + A2 + A3 = 1; 

• A(y9(£,^T), angle between the muon and the 
direction perpendicular to the beam axis; 

• l'?jet|'°''^, \'n\ of the jet with maximum pseudorapid- 

ity; 



• ^ 7y^, sum of the squared pseudorapidities of up to 
four jets; 

• NJW, built from the transverse momenta of up to 
four leading jets, it corresponds to the jet multi- 
plicity above a given jet pT threshold, over the 
range between 10 GeV and 55 GeV, weighted by 
the threshold, and is sensitive to the additional ra- 
diation in the event and to the pT spectrum of the 
jets in the event (soj : 

• Mj^^^, Sum of invariant masses of the three di- 
jet pairs formed from the three leading jets in the 
event; 

• the minimum of the invariant mass of any 
two jets in the event. 

Variables that characterize event energy scale (i?T, 
H'rp, Mti -Mevent) show the best discrimination power, 
but they are sensitive to the jet energy calibration, which 
is one of the dominant sources of systematic uncertainty 
on the tt cross section. A combination of variables be- 
longing to different classes provides the best total uncer- 
tainty on the cross section. 
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